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Systematic Gene Search in the Incy f,5;^j^gl>r05^ Database 



Normal tissue ^ 
-50,000 individual ESTs 



Tumor tissue 

-50,000 individual ESTs 



Priority list 
High 

Prostate 

Breast 

Ovary 

Bladder 

Uterus 

Low 



Iterative assembling 
with 

increasing mismatch 



•8,000 contigs 
+ 

-25 , 000 individual 
sequences 



-8,000 contigs 
+ 

-25, 000 individual 
sequences 



Comparison of databases 



normal tissue- 
specific 

( expected : 100-500) 



nonspecif ically 
expressed genes 



tumor tissue- 
specific 
(expected: 100-500) 



Genes of Interest 



Figure 1 
REPLACEMENT PAGE (RULE 26) 
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•-50,000 ESTs of a tissue (e.g.: uterus tumor) 



GAP4 Assembly 1st Round: 
minimum initial match: 20 
maximum number of inserted 

empty spaces per sequence: 8 
maximum percent mismatch: 0 



GAP4 Assembly 2nd Round: 
minimum initial match: 20 
maximum number of inserted 

empty spaces per sequence: 8 
maximum percent mismatch: 1 



GAP4 Assembly 3rd Round: 
minimum initial match: 20 
maximum number of inserted 

empty spaces per sequence: 8 
maximum percent mismatch: 2 



GAP4 Database 1 
Contigs 1 

Individual sequences 1 



unassembled 
ESTs 



GAP4 Database 2 
Contigs 2 

Individual sec[uences 2 



unassembled 
ESTs 



GAP4 Database 3: 
Contigs 3 

Individual sequences 3 



unassembled 
ESTs 



Figure 2bl 
REPLACEMENT PAGE (RULE 26) 



09/646778 



4/10 



GAP4 Database 3 : 
Contigs 3 

Individual sequences 3 



unassesabled 
ESTs 



Consensus 3 



GAP4 Assembly 4th Round: 
minimum initial match: 20 
maximum number of inserted 

empty spaces per sequence: 8 
maximum percent mismatch: 2 



GAP4 Database 4 : 
Contigs 4 

Individual sequences 4 



unassembled 
ESTs 



Consensus 4 



GAP4 Asseiobly 5th Round: 
minimum initial match: 2 0 
maximum number of inserted 

empty spaces per sequence: 8 
maximum percent mismatch: 4 



GAP4 Database 5: 
Contigs 5 

Individual sequences 5 



unassembled 
ESTs 5 



Consensus 5 



Individual sequences 5 



Figure 2b2 
REPLACEMENT PAGE (RULE 26) 
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Consensus 3 



Individual secfuences 5 



Consensus 4 



unassembled 
ESTs 5 



Consensus 5 



GAP4 Assembly 6th Round: 
minimum initial match: 20 
maximum number of inserted 

empty spaces per sequence: 8 
maximum percent mismatch: 4 



Assembled database 
of a specific tissue 
(e.g.: uterus tumor) 



REPLACEMENT PAGE (RULE 26) 



Figure 2b3 
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Assembled database 
of a specific tissue 
(e.g.: uterus tumor) 



Consensus 6 



Read-in as individual sequences 



Database Database of a second 

of a specific tissue specific tissue 

(e.g.: uterus tumor) (e.g.: normal uterus) 



GAP4 Assembly 

minimum initial match: 20 

maximum number of inserted 

empty spaces per sequence: 8 
maximum percent mismatch: 4 



Tumor tissue- Non-tissue- Normal tissue- 

specific ESTs specific ESTs specific ESTs 

Fig. 2b4 



REPLACEMENT PAGE (RULE 26) 
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In silico subtraction of gene expression in various tissues 



'30,000 consensus sequences 
normal tissue 



-30,000 consensus sequences 
tumor tissue 



Assembly at 4% mismatch 



Normal tissue 
Specific genes 



Cancer tissue 
Specific genes 



Genes expressed in both tissues 



Figure 3 
REPLACEMENT PAGE (RULE 2 6) 
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Genes of interest 



Determination of tissue-specific expression 
via electronic Northern (INCYTE LifeSeg and 
public EST databases) 



Candidate genes for tumor suppressors or 
tumor activators 



Figure 4a 



REPLACEMENT PAGE (RULE 26) 



;3£SS0H«0#.Hii.. # 09/646778 



WHILE C. > C,..^ 



9/10 



Partial cDNA sequence 
e.g., EST or contig 
S 



. . .GCCTCAAGTTATC. . . 



Electronic Northern Blot 



Fisher's Exact Test IF Hq 

EXIT 



Automatic Lengthening 



Consensus sequence C 



. ATGTCCTAGCCTCAAGTTATCAGATGCAA . 
Figure 4b 
REPLACEMENT PAGE (RULE 26) 



9d 
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Isolation of genomic BAG and PAC clones 



Chromosomal clone localization via FISH 



Hybridization signal 



Sequencing of clones that are located in regions that have 
chromosomal deletions in prostate and breast cancer leads to 

identification of candidate genes 



Exon Intron 



Confirmation of candidate genes by screening of 
mutations and/or deletions in cancer tissues 



Figure 5 
REPLACEMENT PAGE (RULE 26) 



